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A speech signal is dpcm (differential pulse code modulation) en- 
coded at the sampling rate f s and also pcm encoded at a rate f„/W. 
Blocks of W dpcm words and one pcm word are transmitted. The 
receiver compares the decoded dpcm signal at the end of every block 
with the decoded pcm sample. If the difference is above a threshold, 
it is assumed that one error exists within the block, and a search is 
made for the erroneous dpcm code word. Correction is accomplished 
by inverting the bits in this code word until the difference is below 
the threshold. Whether or not the error is corrected, the dpcm signal 
at the end of the block is forced to the value of the pcm sample, 
thereby preventing error propagation outside the block. The system 
improves segmental s/n ratio by 7 dB for W = 64 and bit error rates 
between 0.1 and 0.5 percent. Larger improvements are available with 
smaller block sizes. In the absence of transmission errors, there is no 
perceptible distortion due to the correction system. 

I. INTRODUCTION 

We describe a new method of protecting dpcm (differential pulse 
code modulation) speech signals against the effects of transmission 
errors. The dpcm bit stream is divided into blocks, and one pcm sample 
is transmitted with each block. At the receiver, the appropriate sample 
of the integrated dpcm signal is compared with the pcm sample. A 
disparity between the two samples is evidence of a transmission error 
within the block. 

When an error is thus detected, the dpcm integrator is reset to the 
value of the pcm sample and an algorithm is invoked to locate the 
error within the block. When the algorithm is successful, the transmis- 
sion error is completely corrected. Even when the algorithm is unsuc- 
cessful, the resetting of the integrator at the receiver prevents the 
error from propagating outside the block in which it occurs. 
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This approach to error protection is different in spirit from conven- 
tional channel coding aimed at protecting a digital information stream 
regardless of its nature. Our method is directly keyed to the dpcm 
character of the message. Because it introduces its own redundancy to 
a dpcm signal, it is more powerful than the ddc (difference detection 
and correction) system described by the authors. 1 ' 2 ddc is implemented 
at the receiver only and infers transmission errors from anomalies 
within the integrated dpcm sample sequence. 

Other authors have reported on the periodic transmission of pcm 
code words in a dpcm picture coding system. 3 The pcm samples were 
used to update the receiver integrator and thereby curtail visible 
streaks caused by dpcm transmission errors. The use of pcm samples 
for error detection and correction is new to this paper. 

II. SYSTEM DEFINITION 
2. 1 The transmitter 

The transmitter (Fig. 1) sends one block of data every W sampling 
intervals. The data consist of the W code words of a conventional 
dpcm encoder plus one pcm code word formed by quantizing the input 
sample at the end of the block. In our implementation, the pcm and 
dpcm samples are formed by the same quantizer. As a consequence, 
for intermediate and high level inputs, the pcm signal is unable to code 
the entire range of signal amplitudes. Therefore, at the receiver, the 
error control mechanism is disabled when the pcm quantizer is over- 
loaded so that errors affecting high amplitude speech samples go 
uncorrected and are allowed to propagate beyond the blocks in which 
they occur. We accept this penalty in order to derive the convenience 




Fig. 1 — Transmitter. The dpcm and pcm bit streams are multiplexed to form the 
transmitted sequence. The symbol rate is f,( W + l)/W Hz. 
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Fig. 2 — Receiver. If, at the end of the block, the pcm sample and the dpcm sample 
differ by more than A, the correction logic is introduced. The switch in the integrator is 
in position 1 for W samples and goes to position 2 at the end of each block provided i 
is not an extreme value. 



of a system with one quantizer, because high amplitude samples occur 
with relatively low probability in speech, and because we anticipate 
that errors in the idle channel and low amplitude regions of the signal 
are the most damaging ones subjectively. 

2.2 The receiver 

During reception of the nth block of data (Fig. 2), the integrator 
generates W dpcm output words y n w+j, from the received dpcm 
sequence {g k } according to: 



Snw+j = ay„w+j-\ + q n \v+j, j = 1, 2, 



W. 



(1) 



These W samples are stored for possible revision by the error correc- 
tion algorithm. At the end of the block, jwnw is reset to the pcm 
sample x { „+ U w if x in+1) w is not the most positive or the most negative 
code word. When Xi n +i)W is at an extreme value, y<„+i>w remains set at 
the value computed from (1). 

2.2.1 Error detection 

Provided i ( „+i,wdoes not indicate quantizer overload, it is compared 
with y {n + 1 )w derived from (1). If these two samples differ by more than 
A, the quantizer step size, an error in the nth block is inferred and a 
search for the error is initiated. Otherwise the samples 



yn W+ 1 , y n VV+2 , 



yin+DW 



(2) 
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are sent to the system low-pass filter, 5wnw is reset to x in +\)w, and 
block n + 1 is processed. 

2.2.2 Error correction 

When \x (n+ i)w - 5wi>w| > A, the system scans the samples in its 
buffer and finds the largest sample-to-sample difference in the block. 
That is, it computes 

Sj = | y nW+J - ynw+j-i |, j - 1, 2, • • • , W 

and determines the maximum Sj. If this maximum occurs at the rth 
position in the block, the corrector modifies q r by successively inverting 
bits in the code word L r . With each bit inversion, q r , a version of q r , is 
formed and (1) is iterated to produce a new trial value of y {n +\)w, which 
we call y (n+ uw. Thus we have 

y n w+r = ay n w+ r -\ + q r 

and 

Snw+j = ay n w+j-\ + qnw+j, j = r + 1, r + 2, • • • , W. 

Now, if jwnw satisfies the test, 

iJ'ln+DW — £<n+l)iv| * A, 

the modified sequence 

y n W+\, • • • , j'nH'+r-l, SnW+r, • • • i >Wl)lV 

becomes the system output. Otherwise, a new value of q r is obtained 
by inverting another bit in L r . 

If no single-bit inversion in L r succeeds in bringing jwn w sufficiently 
close to x {n +i)w, the correction attempt ceases and the samples in (2) 
are sent to the output filter. 

Clearly, this correction scheme is effective only when there is one 
bit in error in a block of W samples and this single error leads to a 
large sample-to-sample difference. However, even when there is more 
than one error in the block, the updating of the integrator signal 
prevents long-term propagation of error effects. 

Errors in pcm samples induce distortions that would not be present 
in ordinary dpcm. Our performance evaluations indicate that the 
effects of these distortions are substantially smaller than the benefits 
of error correction and integrator updating. 

III. EVALUATION 

The technique was evaluated by means of computer simulation of a 
7-bit, 8-kHz single-integration dpcm system with prediction coefficient 
0.9. To obtain the objective measurements displayed in Figs. 3 to 5, we 
repeatedly played a single sentence through the system: "I have two 
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Fig. 3— -Performance as a function of input level, (a) No transmission errors. Segmen- 
tal s/n ratio reflects only quantizing noise. The curves with and without error protection 
virtually coincide, (b) Transmission error rate is 0.0014 and the system is unprotected 
(c) Error rate is 0.0014 and protection block size is 64. 



daughters, Lorna and Susan," spoken by a male. By listening to many 
samples of processed speech from a variety of talkers, we confirmed 
that the error protection indicated in these graphs is subjectively 
meaningful and not peculiar to a single utterance. 

For each system configuration, we measured segmental signal-to- 
noise ratio, 4 defined as the decibel average of the signal-to-noise ratios 
in 214 speech segments, each of duration 16 ms. These are the segments 
(out of 224 in the 3.5-second utterance) in which the rms signal level 
exceeds -60 dB relative to the peak signal. Segmental s/n ratio is 
considered a better indicator of speech quality than ordinary s/n ratio. 
In a study of adaptive dpcm, there was a correlation of 0.93 between 
segmental s/n ratio and subjective ratings of speech quality. 5 The 
comparable correlation with ordinary s/n ratio was only 0.69. 

Variables in our experiment were input signal level, block size W, 
and transmission error rate. The simulated channel introduces random 
errors to the serial bit stream consisting of W 7-bit dpcm code words 
and one 7-bit pcm code word per block. 

Figure 3 shows segmental s/n ratio as a function of input level for a 
block size W = 64 and for unprotected dpcm. The top curve is for zero 
error rate, in which case s/n ratio is the same (to within 0.3 dB) with 
and without the error protection mechanism. Curve b shows the effect 
on dpcm of a channel with error rate 0.0014. In the granular noise 
region (signal level below the peak of curve a), these errors cause a 
degradation of about 13 dB in s/n ratio. Over most of this region, the 
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Fig. 4 The effect of block size on performance. The relative input level is dB and 

the error rate is 0.0014. Over most of the range, s/n ratio decreases by about 2 dB per 
octave increase in block size. 



protection system redeems about 9 dB of this loss. For reasons dis- 
cussed in Section 2.1, the effectiveness of the error protection dimin- 
ishes as signal level increases. 

Figure 4 pertains to the 0-dB relative input level (see Fig. 3) and 
0.0014 error rate. It shows the effect on segmental s/n ratio of varying 
the block size, W, over the range 8 to 128 samples. A small block size 
offers more protection but exacts a greater penalty in transmitted bit 
rate than a large block size. With W = 8 there is almost one extra bit 
per code word and an improvement of 11.5 dB in s/n ratio. The 
improvement decreases by about 2 dB per octave change in W over 
the range we investigated. 

The effectiveness of the error protection as a function of channel 
quality is shown in Fig. 5 for the input level dB and W = 64. Our 
error protection mechanism results in an s/n ratio improvement of 
about 6 to 8 dB at error rates between 0.001 and 0.01. For comparison, 
we also show the performance of a dpcm system protected by the ddc 
(difference detection and correction) scheme. 1,2 ddc requires no mod- 
ification of the dpcm transmitter and no additional transmitted bits. 
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Fig. 5— Performance as a function of channel error rate, dB relative input level. 
Here, the block protection scheme is compared with ordinary dpcm and with dpcm 
augmented at the receiver by ddc, a difference detection and correction system. 
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Fig. 6— Oscillograms of a segment of the test utterance, (a) Original speech, dB 
relative input level, (b) dpcm output with 0.0042 error per bit. (c) After block protection 
with W= 64. (d) After selective smoothing of (c) by means of ddc. 
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Its performance, however, is about 2 dB poorer than that of the new 
scheme. 

IV. REPROCESSING THE PARTIALLY CORRECTED SPEECH 

The forced updating of the integrator signal itself causes sharp 
transients in the system output when a detected error cannot be 
corrected. Thus, the output signal often contains many spurious spikes. 
This phenomenon is illustrated by the speech waveforms in Fig. 6. A 
segment of the original speech signal is shown in Fig. 6a, and Fig. 6b 
shows the same segment corrupted by transmission errors (which 
occurred with probability 0.0042). Figure 6c shows the output of the 
protected dpcm system, which has suppressed most of the channel 
error noise but left a residual spike (click) in the signal, ddc 6 (as 
applied to pcm) is designed to smooth out such spikes and the effect 
of reprocessing the system output with ddc is seen in Fig. 6d. In 
general, appending ddc to the forced updating method is effective in 
suppressing residual impulse noice. 

V. CONCLUSIONS 

By periodically introducing pcm samples to a dpcm signal sequence, 
it is possible to reduce substantially the propagation of transmission 
errors in dpcm. Furthermore, at the cost of some delay, storage, and 
elementary signal processing in the receiver, many errors can be 
completely corrected. Small block sizes (frequent pcm transmissions) 
are more effective but introduce larger transmission rate penalties 
than large block sizes. 

We have shown that this block protection scheme improves segmen- 
tal s/n ratio of speech signals, and our experience of listening to several 
speech samples processed this way confirms that subjective quality is 
correspondingly enhanced. The method is also very appropriate to 
dpcm transmission of video signals in which error propagation causes 
very objectionable streaks in pictures. 3 
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